Segmentation of spoken dialogue by interjections, disfluent utterances and pauses
نویسندگان
چکیده
This paper attempts to segment spontaneous speech of human-to-human spoken dialogues into a relatively large unit of speech, that is, a sub-phrasal unit segmented by interjections, dis uent utterances and pauses. A spontaneous speech model incorporating prosody was developed, in which three kinds of speech segment models and the transition probabilities among them were speci ed. The segmentation experiments showed that 87.6 % of the segment boundaries were located correctly within 50 msec, 81.2 % within 30 msec, which showed 10.1 point increase in performance comparing with the initial model without prosodic information.
منابع مشابه
Segmentation of Spoken Dialogue by Interjections, Dis uent Utterances and Pauses
This paper attempts to segment spontaneous speech of human-to-human spoken dialogues into a relatively large unit of speech, that is, a sub-phrasal unit segmented by interjections, dis uent utterances and pauses. A spontaneous speech model incorporating prosody was developed, in which three kinds of speech segment models and the transition probabilities among them were speci ed. The segmentatio...
متن کاملDisfluency detection in a dialogue system
Disfluency detection is the task of recognizing structural metadata in spoken utterances. It has been the topic of several studies in computational linguistics and psycholinguistics. This paper motivates the need for automatic disfluency detection in a dialogue system and delineates some of the features that characterize a disfluent utterance.
متن کاملListening to the sound of silence: Investigating the consequences of disfluent silent pauses in speech for listeners
Silent pauses are a common form of disfluency in speech yet little attention has been paid to them in the psycholinguistic literature. The present paper investigates the consequences of such silences for listeners, using an Event-Related Potential (ERP) paradigm. Participants heard utterances ending in predictable or unpredictable words, some of which included a disfluent silence before the tar...
متن کاملAn Integrated Approach to Robust Processing of Situated Spoken Dialogue
Spoken dialogue is notoriously hard to process with standard NLP technologies. Natural spoken dialogue is replete with disfluent, partial, elided or ungrammatical utterances, all of which are difficult to accommodate in a dialogue system. Furthermore, speech recognition is known to be a highly error-prone task, especially for complex, open-ended domains. The combination of these two problems – ...
متن کاملRobust Processing of Situated Spoken Dialogue
Spoken dialogue is notoriously hard to process with standard language processing technologies. Dialogue systems must indeed meet two major challenges. First, natural spoken dialogue is replete with disfluent, partial, elided or ungrammatical utterances. Second, speech recognition remains a highly errorprone task, especially for complex, open-ended domains. We present an integrated approach for ...
متن کامل